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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/913 , 494A 



DATE: 12/11/2002 
TIME: 09:37:39 



1 <110> 

2 <120> 
in Expression 

3 
4 
5 

C — > 6 



<130> 
<140> 
<141> 

7 <150> 

8 <151> 

9 <160> 
10 <170> 

12 <210> 

13 <211> 

14 <212> 

15 <213> 

16 <220> 

17 <221> 

18 <222> 

19 <223> 

20 <220> 

21 <221> 

22 <222> 

23 <223> 

24 <220> 

25 <221> 

26 <222> 

27 <223> 

28 <400> 
29 

30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 
41 
42 
43 
44 



Input Set : N:\Crf4\12022002\l913494.raw 
Output Set: N:\CRF4\12112002\l913494A.raw 

APPLICANT: Merck Patent GmbH • n ■ ^■ 

TITLE OF INVENTION: Glucose Dehydrogenase Fusion Proteins and their Utilization 

Systems 

FILE REFERENCE: Merck 2289 / P9906920 
CURRENT APPLICATION NUMBER: US/09/913, 494A 
CURRENT FILING DATE: 1999-02-19 

PRIOR APPLICATION NUMBER: DE 19906920 
PRIOR FILING DATE: 1999-02-19 
NUMBER OF SEQ ID NOS : 17 
SOFTWARE: Patentin version 3.1 
SEQ ID NO: 1 
LENGTH: 3992 
TYPE : DNA 

ORGANISM: Bacillus megaterium 
FEATURE : 
NAME /KEY: 
LOCATION 




CDS 

x^u^^-Lx^.., (186).. (968) 

OTHER INFORMATION: Glucose Dehydrogenase from Bacillus megaterium 
FEATURE : 
NAME /KEY: gene 
LOCATION: (1)..(3992) 
OTHER INFORMATION: plasmid PAW2 
FEATURE: 
NAME/KEY: CDS 
LOCATION: (978) . . (1010) 
OTHER INFORMATION: poly-histidin tag 
SEQUENCE: 1 

ccatcgaatg gccagatgat taattcctaa tttttgttga cactctatca ttgatagagt 
tattttacca ctccctatca gtgatagaga aaagtgaaat gaatagttcg acaaaaatct 
agataacgag ggcaatcgat gaattcgagc tcggtacccg gggatccctc gaggtcgacc 
tgcag atg tat aca gat tta aaa gat aaa gta gtt gta att aca ggt gga 
Met Tyr Thr Asp Leu Lys Asp Lys Val Val Val He Thr Gly Gly 
15 10 15 

tea aca ggt tta gga cgc gca atg get gtt cgt ttc ggt caa gaa gaa 
Ser Thr Gly Leu Gly Arg Ala Met Ala Val Arg Phe Gly Gin Glu Glu 

20 25 30 

gca aaa gtt gtt att aac tat tac aac aat gaa gaa gaa get eta gat 
Ala Lys Val Val He Asn Tyr Tyr Asn Asn Glu Glu Glu Ala Leu Asp 

35 40 45 

gcg aaa aaa gaa gta gaa gaa gca ggc gga caa gca ate ate gtt caa 
Ala Lys Lys Glu Val Glu Glu Ala Gly Gly Gin Ala He He Val Gin 

50 55 60 

ggc gat gta aca aaa gaa gaa gac gtt gta aat ctt gtt caa aca get 



60 
120 
180 
230 



278 



326 



374 



422 
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80 
81 
82 
83 



86 



RAW SEQUENCE LISTING DATE: 12/11/2002 

PATENT APPLICATION: US/09/913 , 494A TIME: 09:37:39 

Input Set : N:\Crf4\12022002\l913494.raw 
Output Set: N:\CRF4\12112002\l913494A.raw 

Gly Asp Val Thr Lys Glu Glu Asp Val Val Asn Leu Val Gin Thr Ala 



45 ~-j 

46 65 -'O 75 
47 
48 



165 170 175 

tat acq cca aaa ggt ate cgc gta aat aat att gga cca ggt gcg atg 
Tvr Ala Pro Lys Gly He Arg Val Asn Asn He Gly Pro Gly Ala Met 
180 185 ■ 190 



aac aca cca att aac gca gag aaa ttt gca gat cca gaa caa cgt gca 
Asn Thr Pro lie Asn Ala Glu Lys Phe Ala Asp Pro Glu Gin Arg Ala 

70 195 200 205 

71 gac gta gaa age atg att cca atg ggt tac ate ggt aaa cca gaa gaa 

72 Asp Val Glu Ser Met He Pro Met Gly Tyr He Gly Lys Pro Glu Glu 

73 210 215 220 

7 4 gta gca gca gtt gca gca ttc tta get tea tea caa gca age tat gta 

75 val Ala Ala Val Ala Ala Phe Leu Ala Ser Ser Gin Ala Ser Tyr Val 

76 225 230 235 

77 aca ggt att aca tta ttt gca gat ggc ggt atg acg aaa tac ect tet 

78 Thr Gly He Thr Leu Phe Ala Asp Gly Gly Met Thr Lys Tyr Pro Ser 

79 240 245 250 . ^^55 
ttc caa gca gga aga ggc taatagagc get atg aga gga teg cat cac cat 
Phe Gin Ala Gly Arg Gly Ala Met Arg Gly Ser His His His 

260 265 
cac cat cac taatagaagc ttgacctgtg aagtgaaaaa tggcgcacat 
84 His His His 

tgtgcgaeat tttttttgtc tgecgtttae egetaetgcg teacggatct ccacgegecc 



470 



518 



att aaa gaa ttt ggt aca tta gac gta atg att aac aac get ggt gtt 
He Lvs Glu Phe Gly Thr Leu Asp Val Met He Asn Asn Ala Gly Val 

49 80 85 90 95 

50 gaa aac cca gtt cct tct cat gag eta tct eta gat aac tgg aac aaa 

51 Glu Asn Pro Val Pro Ser His Glu Leu Ser Leu Asp Asn Trp Asn Lys 

52 100 105 110 

53 gtt att gat aca aac tta aca ggt gca ttc tta gga age cgt gaa gca bbb 

54 Val He Asp Thr Asn Leu Thr Gly Ala Phe Leu Gly Ser Arg Glu Ala 

55 115 120 125 

56 att aaa tac ttc gtt gaa aac gac att aaa gga aat gtt ate aac atg 614 

57 He Lys Tyr Phe Val Glu Asn Asp He Lys Gly Asn Val He Asn Met 

58 130 135 140 

59 tet age gtt cac gaa atg att cct tgg cca tta ttt gtt cae tac gca 

60 Ser Ser Val His Glu Met He Pro Trp Pro Leu Phe Val His Tyr Ala 

61 145 150 155 

62 gca agt aaa ggc ggt atg aaa eta atg acg gaa aca ttg get ctt gaa 
Ala Ser Lys Gly Gly Met Lys Leu Met Thr Glu Thr Leu Ala Leu Glu 



662 



710 



758 



63 

64 160 
65 
66 

fi7 -^"""^ ~" one 

. , _ r^r^^ rra-t- r- 3 d R C.^B. COt QCa oUD 

68 

69 



854 



902 



950 



1001 



1050 



1110 
1170 



87 tgtagcggeg eattaagege ggegggtgtg gtggttacge gcagcgtgae cgctacactt 

88 gecagegecc tagegecege tectttcget ttcttcectt cetttctcgc cacgttegec 1230 

89 ggctttecec gtcaagetet aaatcggggg etccetttag ggttecgatt tagtgcttta 1290 

90 cggcaceteg accceaaaaa acttgattag ggtgatggtt caegtagtgg gccatcgccc 1350 

91 tgatagaegg tttttegccc tttgacgttg gagtccacgt tctttaatag tggaetettg 1410 

92 ttecaaactg gaacaacact caacectatc teggtctatt ettttgattt ataagggatt 1470 

93 ttgcegattt eggcetattg gttaaaaaat gagetgattt aacaaaaatt taaegcgaat 1530 
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94 



RAW SEQUENCE LISTING DATE: 12/11/2002 

PATENT APPLICATION: US/09/913 , 494A TIME: 09:37:39 

Input Set : N:\Crf4\12022002\l913494.raw 
Output Set: N:\CRF4\12112002\l913494A.raw 

tttaacaaaa tattaacgct tacaatttca ggtggcactt ttcggggaaa tgtgcgcgga 



1590 
1650 



95 acccctattt gtttattttt ctaaatacat tcaaatatgt atccgctcat gagacaataa 

96 ccctgataaa tgcttcaata atattgaaaa aggaagagta tgagtattca acatttccgt 1710 

97 gtcgccctta ttcccttttt tgcggcattt tgccttcctg tttttgctca cccagaaacg 1770 

98 ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac gagtgggtta catcgaactg 1830 

99 gatctcaaca gcggtaagat ccttgagagt tttcgccccg aagaacgttt tccaatgatg 1890 

100 agcactttta aagttctgct atgtggcgcg gtattatccc gtattgacgc cgggcaagag 

101 caactcggtc gccgcataca ctattctcag aatgacttgg ttgagtactc accagtcaca 

102 gaaaagcatc ttacggatgg catgacagta agagaattat gcagtgctgc cataaccatg 

103 agtgataaca ctgcggccaa cttacttctg acaacgatcg gaggaccgaa ggagctaacc 

104 gcttttttgc acaacatggg ggatcatgta actcgccttg atcgttggga accggagctg 

105 aatgaagcca taccaaacga cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg 

106 ttgcgcaaac tattaactgg cgaactactt actctagctt cccggcaaca attgatagac 

107 tggatggagg cggataaagt tgcaggacca cttctgcgct cggcccttcc ggctggctgg 

108 tttattgctg ataaatctgg agccggtgag cgtggctctc gcggtatcat tgcagcactg 

109 gggccagatg gtaagccctc ccgtatcgta gttatctaca cgacggggag tcaggcaact 

110 atggatgaac gaaatagaca gatcgctgag ataggtgcct cactgattaa gcattggtag 

111 gaattaatga tgtctcgttt agataaaagt aaagtgatta acagcgcatt agagctgctt 

112 aatgaggtcg gaatcgaagg tttaacaacc cgtaaactcg cccagaagct aggtgtagag 

113 cagcctacat tgtattggca tgtaaaaaat aagcgggctt tgctcgacgc cttagccatt 

114 gagatgttag ataggcacca tactcacttt tgccctttag aaggggaaag ctggcaagat 

115 tttttacgta ataacgctaa aagttttaga tgtgctttac taagtcatcg cgatggagca 

116 aaagtacatt taggtacacg gcctacagaa aaacagtatg aaactctcga aaatcaatta 

117 gcctttttat gccaacaagg tttttcacta gagaatgcat tatatgcact cagcgcagtg 

118 gggcatttta ctttaggttg cgtattggaa gatcaagagc atcaagtcgc taaagaagaa 

119 agggaaacac ctactactga tagtatgccg ccattattac gacaagctat cgaattattt 

120 gatcaccaag gtgcagagcc agccttctta ttcggccttg aattgatcat atgcggatta 

121 gaaaaacaac ttaaatgtga aagtgggtct taaaagcagc ataacctttt tccgtgatgg 3210 

122 taacttcact agtttaaaag gatctaggtg aagatccttt ttgataatct catgaccaaa 3^/0 

123 atcccttaac gtgagttttc gttccactga gcgtcagacc ccgtagaaaa gatcaaagga 

124 tcttcttgag atcctttttt tctgcgcgta atctgctgct tgcaaacaaa aaaaccaccg 

125 ctaccagcgg tggtttgttt gccggatcaa gagctaccaa ctctttttcc gaaggtaact 

126 ggcttcagca gagcgcagat accaaatact gtccttctag tgtagccgta gttaggccac 

127 cacttcaaga actctgtagc accgcctaca tacctcgctc tgctaatcct gttaccagtg 

128 gctgctgcca gtggcgataa gtcgtgtctt accgggttgg actcaagacg atagttaccg 

129 gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca cacagcccag cttggagcga 

130 acgacctaca ccgaactgag atacctacag cgtgagctat gagaaagcgc cacgcttccc 

131 gaagggagaa aggcggacag gtatccggta agcggcaggg tcggaacagg agagcgcacg 

132 agggagcttc cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc 

133 tgacttgagc gtcgattttt gtgatgctcg tcaggggggc ggagcctatg gaaaaacgcc 

134 agcaacgcgg cctttttacg gttcctggcc ttttgctggc cttttgctca catgacccga 

135 ca 

137 <210> SEQ ID NO: 2 

138 <211> LENGTH: 261 

139 <212> TYPE: PRT 

140 <213> ORGANISM: Bacillus megaterium 

141 <400> SEQUENCE: 2 , ^ 

142 Met Tyr Thr Asp Leu Lys Asp Lys Val Val Val He Thr Gly Gly Ser 

143 15 10 15 



1950 
2010 
2070 
2130 
2190 
2250 
2310 
2370 
2430 
2490 
2550 
2610 
2670 
2730 
2790 
2850 
2910 
2970 
3030 
3090 
3150 



3330 
3390 
3450 
3510 
3570 
3630 
3690 
3750 
3810 
3870 
3930 
3990 
3992 
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RAW SEQUENCE LISTING DATE: 12/11/2002 

PATENT APPLICATION: US/09/913 , 494A TIME: 09:37:39 

input Set : N:\Crf4\12022002\I913494.raw 
Output Set: N:\CRF4\12112002\I913494A.raw 

Thr Gly Leu Gly Arg Ala Met Ala Val Arg Phe Gly Gin Glu Glu Ala 



144 

145 20 



25 30 



14 6 Lys Val Val He Asn Tyr Tyr Asn Asn Glu Glu Glu Ala Leu Asp Ala 

147 35 40 45 

148 Lys Lys Glu Val Glu Glu Ala Gly Gly Gin Ala He He Val Gin Gly 

149 50 55 60 

150 Asp Val Thr Lys Glu Glu Asp Val Val Asn Leu Val Gin Thr Ala lie 

151 65 70 75 

152 Lys Glu Phe Gly Thr Leu Asp Val Met He Asn Asn Ala Gly Val Glu 

III Asn Pro Val Pro Ser His Glu Leu Ser Leu Asp Asn Trp Asn Lys Val 

155 100 105 110 

156 He Asp Thr Asn Leu Thr Gly Ala Phe Leu Gly Ser Arg Glu Ala He 

157 115 120 125 

158 Lys Tyr Phe Val Glu Asn Asp He Lys Gly Asn Val He Asn Met Ser 

159 130 135 140 
Ser Val His Glu Met He Pro Trp Pro Leu Phe Val His Tyr Ala Ala 

161 145 



160 Ser Val His Glu Met iie fro irp rj.u ...^ .^.^ ..^^ 

150 155 xdu 

III Ser Lys Gly Gly Met Lys Leu Met Thr Glu Thr Leu Ala Leu Glu Tyr 

163 165 170 175 

164 Ala Pro Lys Gly He Arg Val Asn Asn He Gly Pro Gly Ala Met Asn 

165 180 185 190 

166 Thr Pro He Asn Ala Glu Lys Phe Ala Asp Pro Glu Gin Arg Ala Asp 

167 195 200 205 

168 Val Glu Ser Met He Pro Met Gly Tyr He Gly Lys Pro Glu Glu Val 

169 210 215 220 

170 Ala Ala Val Ala Ala Phe Leu Ala Ser Ser Gin Ala Ser Tyr Val Thr 

230 235 240 

\]l Gly He Thr Leu Phe Ala Asp Gly Gly Met Thr Lys Tyr Pro Ser Phe 

173 245 250 255 

174 Gin Ala Gly Arg Gly 

175 260 

177 <210> SEQ ID NO: 3 

178 <211> LENGTH: 11 

179 <212> TYPE: PRT 

180 <213> ORGANISM: Bacillus megaterium 

181 <400> SEQUENCE: 3 

182 Ala Met Arg Gly Ser His His His His His His 

183 1 5 10 

185 <210> SEQ ID NO: 4 

186 <211> LENGTH: 4193 

187 <212> TYPE: DNA . ^.^A^^r. 

188 <213> ORGANISM: Bacillus megaterium / Heamenteria ghilianii tusion 

189 <220> FEATURE: 

190 <221> NAME/KEY: CDS 

191 <222> LOCATION: (141).. (344) 

192 <223> OTHER INFORMATION: Tridegin 

193 <220> FEATURE: 

194 <221> NAME/KEY: gene 
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RAW SEQUENCE LISTING DATE: 12/11/2002 

PATENT APPLICATION: US/09/913 , 494A TIME: 09:37:39 

Input Set : N:\Crf4\12022002\l913494.raw 
Output Set: N:\CRF4\12112002\I913494A.raw 

195 <222> LOCATION: (1),.(4193) 

196 <223> OTHER INFORMATION: plasmid pAW4 

197 <220> FEATURE: 

198 <221> NAME/KEY: CDS 

199 <222> LOCATION: ( 387 )..( 1169 ) 

200 <223> OTHER INFORMATION: Glucose Dehydrogenase 

201 <220> FEATURE: 

202 <221> NAME/KEY: CDS 

203 <222> LOCATION: ( 1179) ..{ 1211 ) 

204 <223> OTHER INFORMATION: poly histidine tag 

205 <400> SEQUENCE: 4 ^ ^ 

206 ccatcgaatg gccagatgat taattcctaa tttttgttga cactctatca ttgatagagt 

207 tattttacca ctccctatca gtgatagaga aaagtgaaat gaatagttcg acaaaaatct 

208 agataacgag ggcaatcgat atg aaa eta ttg cct tgc aaa gaa tgg cat caa 1/3 

209 
210 



217 ctg att aaa cct atg gat gat ata tac caa aga cca gtc gag ttt cca 

Leu He Lys Pro Met Asp Asp He Tyr Gin Arg Pro Val Glu Phe Pro 



218 
219 
220 



219 45 50 55 



223 
224 



60 
120 



221 



Met Lys Leu Leu Pro Cys Lys Glu Trp His Gin 
1 5 10 

2n ggt att cct aac cct agg tgc tgg tgt ggg get gat eta gaa tgc gca 

212 Gly He Pro Asn Pro Arg Cys Trp Cys Gly Ala Asp Leu Glu Cys Ala 

213 15 

214 caa gae caa tac tgt gee ttc ata cct caa tgt aga cca aga tea gaa 269 

215 Gin Asp Gin Tyr Cys Ala Phe He Pro Gin Cys Arg Pro Arg Ser Glu 

216 30 35 40 



317 



364 



416 



aac ctt cca tta aaa cct agg gag gaa agcgctatga gaggatcgca 

221 Asn Leu Pro Leu Lys Pro Arg Glu Glu 

222 60 ^5 
tcaccatcac catcacctgc ag atg tat aca gat tta aaa gat aaa gta gtt 

Met Tyr Thr Asp Leu Lys Asp Lys Val Val 

225 ''^ 

226 gta att aca ggt gga tea aca ggt tta gga ego gca atg get gtt cgt 4 64 

227 Val lie Thr Gly Gly Ser Thr Gly Leu Gly Arg Ala Met Ala Val Arg 

228 80 85 90 

229 ttc ggt caa gaa gaa gca aaa gtt gtt att aac tat tac aac aat gaa 
2 30 Phe Gly Gin Glu Glu Ala Lys Val Val He Asn Tyr Tyr Asn Asn Glu 

231 95 100 105 110 

232 gaa gaa get eta gat geg aaa aaa gaa gta gaa gaa gca ggc gga caa 

233 Glu Glu Ala Leu Asp Ala Lys Lys Glu Val Glu Glu Ala Gly Gly Gin 

234 115 ''■^^ 

235 gca ate ate gtt caa ggc gat gta aca aaa gaa gaa gae gtt gta aat 

236 Ala He He Val Gin Gly Asp Val Thr Lys Glu Glu Asp Val Val Asn 

130 135 1^0 

238 ctt gtt caa aca get att aaa gaa ttt ggt aca tta gae gta atg att 

239 Leu Val Gin Thr Ala He Lys Glu Phe Gly Thr Leu Asp Val Met He 

240 145 150 155 

241 aac aac get ggt gtt gaa aac cca gtt cct tet cat gag eta tct eta 

242 Asn Asn Ala Gly Val Glu Asn Pro Val Pro Ser His Glu Leu Ser Leu 

243 160 165 I'^O 



512 



560 



608 



656 



704 
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RAW SEQUENCE LISTING ERROR SUMMARY DATE: 12/11/2002 

PATENT APPLICATION: US/09/913 , 494A TIME: 09:37.40 

input Set : N:\Crf4\12022002\I913494.raw 
output Set: N:\CRF4\12112002\I913494A.raw 

Invalid Line _jjgngthi -ry oViar-acters in length. This includes spaces, 

^ii^^iiiiT^i^e that a line not exceed 72 characters in x g 

Seq#:l; Line(s) 2 
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VERIFICATION SUMMARY DATE: 12/11/2002 

PATENT APPLICATION: US/09/913 , 494A TIME: 09:37:40 

Input Set : N:\Crf4\12022002\l913494.raw 
Output Set: N:\CRF4\12112002\l913494A.raw 

L:6 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:450 M:258 W: Mandatory Feature missing, <223> Blank for SEQ# : 12 , Line# : 0 
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